Application of machine learning and visualization of heterogeneous datasets to uncover relationships between translation and developmental stage expression of C. elegans mRNAs.

نویسندگان

  • Marjan Trutschl
  • Tzvetanka D Dinkova
  • Robert E Rhoads
چکیده

The relationships between genes in neighboring clusters in a self-organizing map (SOM) and properties attributed to them are sometimes difficult to discern, especially when heterogeneous datasets are used. We report a novel approach to identify correlations between heterogeneous datasets. One dataset, derived from microarray analysis of polysomal distribution, contained changes in the translational efficiency of Caenorhabditis elegans mRNAs resulting from loss of specific eIF4E isoform. The other dataset contained expression patterns of mRNAs across all developmental stages. Two algorithms were applied to these datasets: a classical scatter plot and an SOM. The outputs were linked using a two-dimensional color scale. This revealed that an mRNA's eIF4E-dependent translational efficiency is strongly dependent on its expression during development. This correlation was not detectable with a traditional one-dimensional color scale.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

شناسایی RNA های غیرکدکننده کوتاه ‌عملکردی با استفاده از روش های بیوانفورماتیکی در گوسفند و بز

MicroRNAs (miRNAs) are small non-coding RNAs that have functional roles in post-transcriptional modification. They regulate gene expression by an RNA interfering pathway through cleavage or inhibition of the translation of target mRNA. Numerous miRNAs have been described for their important functions in developmental processes in numerous animals, but there is limited information about sheep an...

متن کامل

Mechanism and regulation of translation in C. elegans.

C. elegans represents a favorable system to study the extraordinarily complicated process of eukaryotic protein synthesis, which involves over 100 RNAs and over 200 polypeptides just for the core machinery. Initial research in protein synthesis relied on fractionated mammalian and plant systems, but in the mid-1970s, the powerful genetics of Saccharomyces cerevisiae began to yield new insights ...

متن کامل

Application of ensemble learning techniques to model the atmospheric concentration of SO2

In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...

متن کامل

P-144: Glucose-6-Phosphate Dehydrogenase Activity in Ovine Oocytes in Association with Developmental Characteristics and Expression of Bax Gene

Background: Recent studies have revealed that oocyte competence can be assessed through the presence of the glucose-6-phosphate dehydrogenase (G6PDH) enzyme, as indicated by brilliant cresyl blue (BCB), a dye that can be degraded by G6PDH. In the present study, we aimed to examine the validity of BCB test to select developmentally competent oocyte in ovine and its association with stage-specifi...

متن کامل

Mammalian Eye Gene Expression Using Support Vector Regression to Evaluate a Strategy for Detecting Human Eye Disease

Background and purpose: Machine learning is a class of modern and strong tools that can solve many important problems that nowadays humans may be faced with. Support vector regression (SVR) is a way to build a regression model which is an incredible member of the machine learning family. SVR has been proven to be an effective tool in real-value function estimation. As a supervised-learning appr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Physiological genomics

دوره 21 2  شماره 

صفحات  -

تاریخ انتشار 2005